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Abstract 



This article discusses the properties of extremes of degree sequences calculated 
from network data. We introduce the notion of a normalized degree, in order to 
permit a comparison of degree sequences between networks with differing num- 
bers of nodes. We model each normalized degree as a bounded continuous ran- 
dom variable, and determine the properties of the ordered /c-maxima and minima 
of the normalized network degrees when they comprise a random sample from a 
Beta distribution. In this setting, their means and variances take a simplified form 
given by their ordering, and we discuss the relation of these quantities to other 
prescribed decays such as power laws. We verify the derived properties from sim- 
ulated sets of normalized degrees, and discuss possible extensions to more flexible 
classes of distributions. 

> 

^ ; 1 Introduction 

Networks are ubiquitous as models of relational data in science and engineering; their structure is 
described via linkages between the set of nodes of the network. Linkages are commonly assessed in 
terms of which nodes they connect; for example, important nodes may connect to many other nodes, 
while less important nodes may make only a few connections. In this article, we study the propensity 
of the most important nodes for linkage, as assessed in terms of the degree sequence of the overall 
set of network nodes. In particular, we study the properties of the extreme degrees, and discuss their 
moments for a flexible class of (normalized) random degrees based on the Beta distribution. We also 
investigate their properties via simulations, and show how the derived properties relate to observed 



extreme degrees. 



Each linkage between two nodes can either be present or absent, and so is modelled by a Bernoulli 
random variable (independent of all others) taking the value zero or one 0] |2]. We collect the 
links between any two nodes in a matrix A, known as the adjacency matrix, which by necessity is 
symmetric. The sum of all linkages associated with a single node i gives the degree of that node; 
in the simplest sense the degree di indicates the importance of node i to the network O. Starting 
from the matrix A, this means summing the matrix entries over row i to form the vector of degrees 
d = Al, where 1 is the vector of all ones. Common practice is to study the full set of degrees d 
for a network, in order to identify the most important (or least important) nodes; i.e., in this context, 
those that have especially extreme degrees. 

We study the properties of extremes of degree sequences, to understand the degree of diversity that 
can be achieved simply by the act of ordering an identically distributed sequence of random variables 
corresponding to observed degrees. To remove an arbitrary dependence on the size n of the network, 
we rescale each degree by the total number of nodes n to yield a normalized degree, or proportion 
7r,; = di/n. This normalized degree can take a value between zero and one. We therefore model the 
normalized degrees according to a distribution supported on [0, 1]. The normalized degrees can also 
only take quantized values of proportions (i.e., the proportion of other nodes that are connected to 



1 



node i), but for sufficiently large networks, approximating their distribution with that of a continuous 
random variable does not significantly alter the properties of the rescaled degrees. We also note that 
network degrees are mildly correlated under the model specified above, as they are constructed 
from the same random matrix and each degree pair shares one potential linkage. However, as the 
magnitude of this correlation decreases with increasing sample sizes, this effect can safely be ignored 
as n grows large, so that subsequently we treat these random variables as a random sample from a 
given distribution. 

In this setting we study the extremes of the normalized degrees. Rescaled extreme values can, if a 
limiting distribution exists, take one of three possible forms (4] [5]. The choice of rescaling has a 
pivotal role on the properties of the observed extremes. Starting from a bounded density, such as a 
Uniform(0, 6) or (more generally) a Beta(a, 6) density, we will obtain the (reversed) Weibull dis- 
tribution HE]. We show in this case how the expected values of the k largest (or smallest) variates 
take the form of a simple curve in terms of their size ordering k, creating an apparent hierarchy 
of the degree sequence. This effect occurs as a consequence of the behavior of order statistics, de- 
spite the degrees corresponding to a random sample of (in point of fact, approximately) independent 
and identically distributed variates. Similarly, the variances of the k maxima take a simple form 
depending on their ordering. These properties depend on the parameters of the distribution under 
consideration. The parameter governing the Beta distribution near its right tail dominates the expec- 
tation and variance of the maxima, and that governing its left tail dominates those of the minima. 

We conclude by briefly investigating the properties of simulated normalized degree sequences. We 
show for certain choices of Beta parameters a and b how the expected normalized degrees exhibit the 
derived properties over a full range of ordered indices. We then compare these means with a popular 
choice of mean decay — that of a power law (3] — and show that the expected ordered degrees can be 
approximately described in terms of a power law, if we allow for the form of a shifted power law as 
introduced in |6|. 



2 Summary of main results 



Our main results concerning order statistics of normalized observed degree sequences are as follows. 

Suppose 7T(i) > 7T(2) > • • • > 7iv n ) are ordered elements from a Beta(a, b) -distributed random 
sample of size n with parameters a, b > 0, intended to model ordered proportions of degrees divided 
by the number n of nodes in the network. Then: 

1. As n — >• oo, the distribution of n 1 / b (l — 7T( fc )) for fixed k converges to a generalized Gamma 
distribution with scale parameter (/3(a, 6)6) 1 / fc and shape parameters (k 7 b), corresponding 
to the Weibull distribution for k = 1 . 

2. When n is large and k is fixed, the first two moments of ir^) can be expressed as follows: 



E7T (fe) 



r(l/6) (P{a,b)b 



P(k,l/b) 



Var (7T( fe) ) x [l-E7T(fc)j 



1/6 



n 

P{k,l/b) 
p(k + l/b,l/b) 



l-^8(o,6)6- 
n 



1/6 



[1-Ett 



fc + 1/6 



1/6 



The latter approximation to Ktt^ is close to the former for a, b > 0, while the latter 
approximation to Var (^(k)) is close to the former for b > 1, or kb 1 when b < 1. 



3. Analogous asymptotic results hold for ir^ < tt( 2 ) 



< 



< 7T 



(n) 



with P(a, b)b replaced 



by /3(a, b)a and 1/6 replaced by 1/a in the discussion and equations above. To wit, for 
the jth smallest order statistic ftu), the distribution of n 1/,6 7Ty) for fixed j approaches a 
generalized Gamma distribution with scale parameter (/3(a, 6)a) 1 / a and shape parameters 
(j, a), and thus E7T(,-) ~ (j3(a, b)a ■ j/n) 



l/a 



as n — > oo. 



For a = 1, the asymptotic expression in k converges quickly in n and holds throughout the sequence, 
while for 6=1 that in j behaves similarly. These expressions are closest to being exact in the case 
of the uniform distribution (a = 6 = 1), whereupon (as we show below) the expectation of the jth 



2 



smallest sample element is given by E 7Ty) = j/(n + 1), or more generally 9j / (n + 1) when values 
are distributed uniformly on the interval [0,8]. 

3 Exact and limiting distributions of order statistics for normalized degrees 

We are interested in the following question: if tti , . . . 7r„ comprise a random sample from a given 
density f(jr) whose support is contained in [0, 1], and whose tail behavior is specified, then what 
decay do the ordered sample values exhibit as they approach zero or one? Answering this question 
allows us to characterize network nodes that have especially extreme degrees, as described above. 

3.1 The special case of Uniform(0, 9) variates 

We shall develop our understanding of the extreme values of 7T; in a series of steps, starting first 
from simple uniform random variables, and then building up our intuition to more complicated 
distributions. We first recall some basic results for order statistics. We index sample elements from 
smallest to largest as tt(i), ■ ■ ■ 7T(„), with ir^ being the jth smallest, recalling 

F{x)] n -\ 
F{x)] n -!f{x). 

From these expressions we can compute E Xfj\ directly, or via the identity EX {j) = J °°(l - 
Fx u) (x)) dx if supp f(x) C M+. It follows that the expected value of the jth smallest element 7T(j) 
from a Uniform(0, 6) random sample is hence straightforwardly obtained as 

^""(j) = ^ — TT' 3 = 1 ' 2 ' 
n + 1 

3.2 Beta(a, b) order statistics and extreme value theory 

The Uniform(0, 9) case considered above is interesting because we see clear structure in terms of 
the ordering of the random sample. It now becomes sensible to extend such a result to distributions 
that show greater preferential weighting to parts of their range. We recall that the Uniform(0, 1) 
distribution is a special case of the Beta(a, b) distribution for a = b = 1. Obtaining exact results for 
the general case of Beta(a, b) variates is possible for integral (a, b) by way of recursions involving 
higher-order moments of a Beta(a, b) variable El HJ. We may also appeal to extreme value 
theory by way of the cumulative distribution function (CDF) of the Beta density. Continuing in this 
direction, we now investigate the limiting behavior (in n) of order statistics of a random sample 
TV i . . . 7r„ from the Beta(a, b) distribution. 

Choosing constants a n = n x / b and b n = 1 with foresight, we will show that the law of the maximal 
order statistic, suitably transformed to u = a ra (7i7 n ) —b n ), converges to a nondegenerate distribution 
function G(u). We do this as follows. 

1. First, we observe that the CDF F(x) of a Beta(a, b) random variable is given by the regu- 
larized incomplete Beta function 

I x (a,b) = P{a,b)- X [ t a - 1 {l-t) b - 1 dt, 0<x<l, 
Jo 

and so we conclude that the law of max^l^ ~ Beta(a, is given by [I x (a, b)] n . 

2. Second, we must show that the CDF of u goes to some limiting law G{u) as n — > oo, and 
so we must verify that lim„_ i . 00 [/ x (a, b)} n exists. 

We proceed as follows, first considering the case when a and b are positive integers. 



^w=E(j[ f wf[i- 

fx U) (x)=(")j[F(x)Y- 1 [l- 
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3.2.1 Extreme values of Beta(a, b), with a, b e N 

Observe that for a, b G N, the CDF F(x) of a Beta(a, 6) random variable reduces to 



a+6-1 



4(a,6)= 2 ° 



6-1 
i 



x l {l - x) 



a-\-b— 1— i 



o < x < l, a,f»eN, 



and so to simplify calculations we shall commence by assuming that the parameters are integer 
valued. Observe that by the binomial theorem, 



a-l 



a + b-l 
i 



and so 



l = I x (a,b) + J2 

i=0 y 



x\l -x) 



a-\-b— 1— i 



[I x (a,b)} = 



x l (l -x) 



a-\-b—l — i 



a,b e N, 



a, ben. 



Since we are concerned with max^jXi ~ Beta(a, we expand x near 1 in — u as follows: 

u 

x = 1 



,1/6 > 



which corresponds to the choice a n = n 1 / 6 , b n = 1 with respect to the limiting behavior we wish to 
explore. We define the function G n (u) as [/i +Kn -i/i (a, b)] n , and detailed calculations in Appendix 
A yield that G n (u) for a, b € N is given by 



G n (u) 



P{a,b)b 



,1/6 



,1/6 



{1 + fab(u,n)} 



(1) 



where |/ afc (w, n)| < [(a - l)/(6 + 1)] {u/n 1 / b ) (l + M/n 1 / 6 ) a I(a > 2). This yields the CDF of 
the rescaled and shifted random variable — 1), and as we shall see, additional insight can be 

determined from ((T). 



3.3 The limiting form of extreme values 

Now observe that as n grows large, for fixed u we have that G n {u) approaches [1 — 
(/3(a, 6)fe) _1 (— u) b /n] n . Thus for fixed u and as n — > oo, we obtain the limiting form 

" (-u) b ' 

G(u)= lim G„( U )=exp -7^7 , (2) 
n~¥oa p(a,b)b 

and thus we find the (reversed) Weibull or Type III distribution (with the term "Type III" referring 
to standard order statistics terminology) as the limiting law G(u) of u — a„(7T( n ) — b n ), with the 
respective choices of scaling and shift parameters a n = n 1 / 6 and b n = 1 as given above. 

We refer to Appendix [B] for using the limiting form of (f2]i to derive the first moment of each of 
the ordered random variables fttj), indexed from smallest to largest. Appendix [B] shows that these 
take the form of generalized Gamma variates, and their moment characterization will be an initial 
indication of the order statistic structure. We can note directly from Appendix IE1 that 



ETTy) = 



T(l/b) f((3(a,b)b)\ 1/b 



,1/6 



«n-j + l,l/!>) V ' 

Finally, we may appeal to the relation j3{n — j + 1, 1/b) ~ T(l/b)(n — j + l) -1 / 6 for n — j + 1 
large and 1/b fixed, whereupon, reverting to the notation that tt^ denotes the A:th largest weight, 
we obtain the following approximation: 

1/6 



Ett 



(fc) 



1 



0{a,b)b- 



(3) 



This yields a simple and elegant form of the expected value of each order statistic, where the decay 
of the sequence is governed by increasing k. 
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3.4 General result for Beta(a, 6) with a, b > 0, and results for minimum degrees 



Note that by properties of the Beta integral, I x (a, b) = 1 — Ii_ x (b, a), and observe fTDl Chs. 15, 26] 
that (b, a) admits the following series expansion for all real-valued a, b > 0, where we interpret 
r(l — a + m)/r(l — a) as the rising factorial (1 — a) m : 



t (h \- (i-^ y- r(i- a + m) 

ll -*^ a > pub) L> T(l-a)ml(b + m) 

x ' m— 7 



(l-z)' 



0(o,6) ^6 ? ^ i r(l-a)m!(6 + m) 1 ' 
Expanding x near unity in — u as before via x = 1 + u/n 1 ^, we see that 



[J*M)] n = 



i - 



{-u) h /n 
P(a,b)b 



1 + b ^.Wi 



r(l -a + m) 



^ r(l — a)m!(6 + rri) 



[(-u) m /n m / fc ] 



,6>0, 



and thus by the same limiting argument as before, our earlier result holds for all a, b > 0. Moreover, 
we may apply the same logic to look at the minimum order statistic by expanding x near zero in 
u via x = u/n 1 ^, obtaining the Type II limiting law (again with the term "Type II" referring to 
standard order statistics terminology) as G(u) — exp[— ((3(a, 6)a) _1 u a ], implying that E7Ty) ~ 

(/3(o, b)a ■ j /n) 1 /" for the jth smallest order statistic Ktj) . 



3.5 Variance expression 

As a final note, observing that E[(— u^) m ] 
order statistic, we obtain 



(a, b)b] m T(k + m/b)/T(k) for the fcth largest 



Var (-«(*,)) 



G0(a, 6)6) 2 T(fc + 2/6) f(3(a, b)bT{k + 1/6) 



Thus 



Var(7r (fe) ) = — [Ei 



t(fc) 



)] 



T(k) 
[E (-«(*))] 

/3(fc,l/6) 
0(k + l/b,l/b) 



r(fc) 



g(M/6) 

L/3(fc + l/6,l/6) 



[1-E7r (fc) ] : 



fe + 1/6 



1/6 



We see directly that (not unreasonably) the variances of the order statistics decrease as their values 
approach more and more closely to 1 . 



4 Simulation study 

To verify the order statistics properties derived above, we undertook a small simulation study based 
on the log-linear model of ifTTl . Edges in this model are independent Bernoulli trials with success 
probabilities Pij for all 1 < i < j < n. A nonnegative weight u>j is associated to each node i, 
with wi/n, . . .w n /n ~ Beta(a, 6), and we then set pij = WiWj/\\w\\i, where i = Wfc- 
When wf < \\w\\i for all i, it follows that the expected degree E(dj) of the ith node is equal to Wi, 
and thus the unnormalized weights wi, . . .w n can be interpreted as expected degrees. 

Figure Q] overleaf shows two examples of network degrees simulated under this model, with aver- 
age degrees respectively n/10 and n/3. From these examples it can be seen that the approximate 
maximal and minimal order statistic expectations agree well with empirical averages over parts of 
their respective ranges (recalling that these results hold for n large and k, j fixed). In Fig. [If a) with 
Wi/n ~ Beta(l,9), we confirm that the maximal order statistic expression holds throughout the 
sequence. In this case f3(a, 6)6 = 1, further simplifying this expression to E^) xl - (fc/n) 1 ^. 
In Fig. Q~fb) with Wi/n ~ Beta(2,4), we also see reasonable agreement between expected and 
empirical quantities over the initial portions of their respective ranges. 
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(a) 7T ~ Beta(l, 9); E(tt) = 1/10 




2000 4000 2000 4000 



a o o v\ 



0.5 1 



10 



10 



(b) ty ~ Beta(2, 4); E(tt) = 1/3 



Figure 1 : Examples showing simulated network degrees for graphs with n = 4000 nodes, along with 
their empirical and predicted order statistics averaged over 100 trials, for the settings (a) Wi/n = 
TTi ~ Beta(l,9) and (b) Wi/n = TTi ~ Beta(2,4) according to the model of Section|4] The 
leftmost column in each case shows the exact and empirical normalized degree distributions; the 
middle column the maximal order statistics on linear and log-log scales; and the rightmost column 
the minimal order statistics on linear and log-log scales. 



5 Discussion 

In this article we have considered normalized degree sequences generated from the Beta(a, b) family 
of random variables. This led us to be able to derive forms for the expected extrema of such random 
variables, as well as their variances, in order to better understand their sampling characteristics in the 
context of random graphs. As such, we have been able to characterize the expected extreme behavior 
seen in network degrees that exhibit different characteristics due only to random variability, and not 
to difference in model parameters per se. In contrast to this notion of structure due only to ordering, 
we note that a popular model for the extreme expected degrees is the power law (6]: 

(E7r) (fc) = ° c, a, 7 keN. (4) 

Here 7 is the power law exponent, c is an overall scaling, and s is a shift parameter that in combi- 
nation with c controls the maximal and average degree values. If we compare (01 with the expected 
values of the ordered normalized degrees under our model, then from (O we note 

nl/6 



k\ ^ 1 



P(a,b)b 



k < n. (5) 



fcV* 



Comparing to (|5j, we see both similarities and differences in structure. Both expressions are 
decreasing in increasing k, and show a smooth decay in k. The distinction between the two is the 
importance of the order of the shift parameter in each case. If we apply the form of (|5), the initial 
decay in k will be far less dramatic than that of @, as the shift in (0 will by necessity be large and 
harder to dislodge by the power in k. Whether (0]i or (0 more realistically models a given network 
will depend on the application from which the network arose; furthermore, from this comparison we 
see that improper selection of the number of degrees used to estimate the power law exponent can 
potentially lead to erroneous conclusions, especially if the magnitudes of s and c are not investigated 
when fitting the model of dU to an observed network. 

Having compared our results to a power law, a number of other conclusions are apparent. Order 
statistics will create an apparent hierarchy in the normalized degrees. This hierarchy is structured. 
It takes the simplest form if the parameters of the Beta distribution we have considered take special 
values; if not, decay is still monotone and predictable. A palatable feature of normalized degrees as 
we have introduced them is that they are easily comparable across varying network sizes, and scale 
naturally; thus, with varying sample sizes, the same scaling structure is achieved. 
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Our simulation study has shown the practical performance of our results for different choices of nor- 
malized degree distributions. We find good agreement with our theoretical results in the respective 
tails — exactly as predicted by order statistics theory. We know that our limiting results hold only for 
indices near the end of the range; i.e., near the maximum or minimum of the range of the random 
variables under consideration. If we instead wanted to consider degrees taking values nearer to the 
center of the distribution, then it would be necessary to apply other theoretical constructions not 
based on order statistics. In this setting, results would depend differently on the particular choice of 
distribution employed. 
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A Simplifying the limiting form of G n (u) 

We shall simplify the form of G n (u). More formally we write G n (u) as 



G n (u) = 



a + b-l 



,1/6 



,1/6 



a+fc— 1— i 



a,b E N. 



(6) 



Our goal will be to bound the sum near its largest term in the polynomial, which will dominate the 
others. This term is given by [/3(a, fr)fr] _1 [l + u/n 1 / b ] a ~ 1 [— u/n 1 l b ] b , and, factoring out this term, 
we see that the sum in © is given by 



a-2 



/3(a,b)b V ri 1 /* 



l + 0(a, 6)6 £ 



a + b-l 

i 



n 1 / 6 + u 



a — 1 — i 



We may bound this sum via the binomial expansion of [1 — uj (n 1 ^ + u)] a 2 as follows: 



E 

i=0 



a+b-l 



n 1 / b + u 



< 



r(a+6) r(i) 
r(a-i)r(6+2) V^ 1/b +w 



E 



a-2 



n 1 / b + u 



a-l\ {-u)/n 1 / b 
P(a,b)b Vb+l) (i + u /n^ b ) a 



Thus G n (u) for a, b £ N is given by 

1 



G n {u) 



1 



p{a,b)b V nV6 



,1/6 



{1 + fab(u,n)} 



where |/ o6 (u,n)| < [(a - l)/(b + 1)] (-u/n 1 / b ) (l + u/n 1 / b ) 1 a I(a > 2). This will help us 
derive the limiting form of G n (u). 



B Calculating moments of the jth smallest order statistic 

We now switch to standard notation for the Weibull to obtain moments of this limiting form. We are 
interested in the maximum order statistic j = n, as well as others for fixed j not depending on n: 

Eufii E(— U(i)) 

E Hi) = + b « = 1 - !/ ■ 

To obtain these results we use Theorem 1.3.3 of Q, which states that for order statistics of a se- 
quence of independent and identically distributed random variables {7Tj}™ =1 , if for some sequences 
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{a n > 0}, {&„} of real constants, a„(7T(„) — b n ) converges in distribution to a random variable with 
nondegenerate distribution function, then for any fixed positive integer j we have that 

P (a n { Hj) b n ) <x)^ G{x) g ( ' l0g .f (x)r , 

with the same a n , b n . We remark that the formula is for the jth smallest, so that the maximum j — n 
will involve only one term — the sum in this case terminating after the zeroth term. 

To employ this result we must evaluate the above expression for the Type III law in question. 
The corresponding probability density function (PDF) is the Weibull(6, A) distribution, where 
b e N, A = (f3(a, b)b) x / b in our previous notation: 

F(x;b,X) = 1 -cxp[-(x/A) b ], x > 0; 
f(x; b,X) = j (j exp[-(x/\) b ], x > 0; 
EX m = A"T (l + y) • 
Now observe that we may express the corresponding limiting law of u^y the jth smallest variable, 

Gu w («) = G{u) 2^ - 



as: 



i=0 



where G(u) = cxp [-(-u) b /(P(a, 6)6)] = cxp[-(-?i/A) fc ], as 

n ~3 ( I \\ib 

G U(j) (u) = cx P [-(- u /A) b ] J2 ~^V^- 

i=0 



T(n - j + 1) A V A 

s T(n- j + l + m/6) A m r(m//c) 



It follows that the limiting PDF g U(j) (u) is given by the generalized Gamma distribution: 

1 b f -u\ ( - n ~ j+1 ^ b ~ 1 

9u u) ( u ) = Yh TT-^-\ — ) exp[-(- U /A) fc ], u < 0; 

E ([-«Ci)D = A ' T(n-j + l) ~ 0(n-j + l,m/by 
where the latter expression follows as the moments of the generalized Gamma are known. 
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